Towards large scale peer-to-peer web search
نویسنده
چکیده
Web search engines, such as Google and Yahoo, are based on the centralized database model. Search engines using the centralized database model suffer from a several drawbacks, such as: they have a single point of failure, a limited representation of the web, their index is not up-to-date, and scalability. Currently a lot of research is being done on using peer-to-peer (P2P) technology for the use of full-text search in order to overcome the issues centralized search engines suffer from. Although P2P systems have proven to be highly scalable in file sharing applications, this is not so obvious for large scale full-text search. In this paper I will discuss and compare some of the most important P2P architectures based on their literature. Based on this, I will set out the direction future P2P research should be heading to, in order to make large scale P2P web search possible.
منابع مشابه
Is P2P a Suitable Architecture for Large-Scale Web Search?
In this note we discuss the feasibility of a peer-to-peer based infrastructure for large-scale web search that could provide an alternative to current centralized engines such as Google. We first outline the structure and performance bottlenecks of current search engines, and then discuss how to construct a peer-to-peer based engine. Our contention is that for the time being, a peer-to-peer sol...
متن کاملSEARCH ENGINE IN LARGE - SCALE PEER - TO - PEER SYSTEMS by AKSHAY LAL
LAL, AKSHAY. Dgoogle: A Full-Text Search Engine in Large-Scale Peer-to-Peer Systems. (Under the direction of Professor Khaled Harfoush). Full-text search engines like Google serve an important role in accessing Internet resources. In such engines, a search for web pages, matching a user’ s query, are typically carried on a set of co-administered, physically co-located clusters of servers. Full-...
متن کاملTowards Virtual Knowledge Communities in Peer-to-Peer Networks
As a result of the anonymity in todays Web search, it is not possible to receive a personalized search result. Neither prior search results nor search results from other users are taken into consideration. In order to resolve this anonymity towards the search engine, a system is created which locally stores the search results in the scope of a peerto-peer network. Using the Peer Search Memory (...
متن کاملTowards Self-Organizing Query Routing and Processing for Peer-to-Peer Web Search
The peer-to-peer computing paradigm is an intriguing alternative to Google-style search engines for querying and ranking Web content. In a network with many thousands or millions of peers the storage and access load requirements per peer are much lighter than for a centralized Google-like server farm; thus more powerful techniques from information retrieval, statistical learning, computational ...
متن کاملSearch Result Caching in Peer-to-Peer Information Retrieval Networks
For peer-to-peer web search engines it is important to quickly process queries and return search results. How to keep the perceived latency low is an open challenge. In this paper we explore the solution potential of search result caching in large-scale peer-to-peer information retrieval networks by simulating such networks with increasing levels of realism. We nd that a small bounded cache o e...
متن کامل